Adaptive Control of a Partially Observed Controlled Markov Chain
نویسنده
چکیده
We consider an adaptive i~nite state controlled Maxkov chain with partial state information, motivated by a class of replacement problems. We present parameter estimation techniques based on the information ava~lable after actions that reset the state to a known v~ue are taken. We prove that the parameter estimates converge w.p.1 to the true (unknown) parameter, under the feedback structure induced by a certainty equivalent adaptive policy. We also show that the adaptive policy is self-optimizing, in a long-run average sense, for any (measurable) sequence of parameter estimates converging w.p.1 to the true parameter. * This work was supported in part by the Texas Advanced TechnoloLrD ' Program under Grant No. 003658093, in part by the Air Force OfFice of Scientific Research under Grants AFOSRo91-0033, F49620-92-J-0045, and F49620-92-J-0083, and in part by the Nation ~I Science Foundation under Grant CDR-8803012. t Systems and Industrial Engineering Department, The University of Arizona, Tucson, Arizona 85721. Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas 78712-I084. § Department of Electrical Engineering and Systems Research Center, The University of Maryland, College Park, Maryland 20742.
منابع مشابه
The Effect of Measurement Errors on the Performance of Variable Sample Size and Sampling Interval Control Chart
The effect of measurement errors on adaptive and non-adaptive control charts has been occasionally considered by researchers throughout the years. However, that effect on the variable sample size and sampling interval (VSSI) control charts has not so far been investigated. In this paper, we evaluate the effect of measurement errors on the VSSI control charts. After a model development, the effe...
متن کاملErgodic and adaptive control of hidden Markov models
A partially observed stochastic system is described by a discrete time pair of Markov processes. The observed state process has a transition probability that is controlled and depends on a hidden Markov process that also can be controlled. The hidden Markov process is completely observed in a closed set and only observed through the other process in the complement of the closed set. Initially a...
متن کاملAnalysis of an Adaptive Control Scheme for a Partially Observed Controlled Markov Chain*
We consider an adaptive finite state controlled Markov chain with partial state information, motivated by a class of replacement problems. We present parameter estimation techniques based on the information available after actions that reset the state to a known value are taken. We prove that the parameter estimates converge w.p.1 to the true (unknown) parameter, under the feedback structure in...
متن کاملFa 7 - 8 : 50 a Methodology for the Adaptive Control of Markov Chains under Partial State Information
We consider a stochastic adaptive control problem where complete state information is not available to the controller. The system is modelled as a finite stochastic automaton (FSA) [PAZ], [DOB]. These models are a slight generalization of the more common partially observable controlled Markov chain models as presented in, e.g. [BE], [KV]. A controlled FSA is described by the quintuplet (X,Y,U,{...
متن کاملA Uniformly Convergent Adaptive Particle Filter
Particle filters are Monte Carlo methods that aim to approximate the optimal filter of a partially observed Markov chain. In this paper, we study the case in which the transition kernel of the Markov chain depends on unknown parameters: we construct a particle filter for the simultaneous estimation of the parameter and the partially observed Markov chain (adaptive estimation) and we prove the c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007